nvme: remove redundant structure of primary controller capabilities
In recent PR that merged has redundant/duplicate structure
with name "nvme_pri_ctrl_caps" that has already defined with
name "nvme_primary_ctrl_caps" and its been used.
Max Gurtovoy [Thu, 1 Apr 2021 08:38:49 +0000 (08:38 +0000)]
nvme: add NVIDIA plugin for NVMe SNAP controllers
SNAP technology is supported on NVIDIA Mellanox BlueField DPU family
and combines a unique hardware-accelerated virtualization of NVMe
devices with the advanced networking and programmability capabilities of
the BlueField SmartNIC.
Add initial vendor specific parsing for id-ctrl command of NVMe SNAP
controllers.
For more info: https://www.mellanox.com/products/software/nvme-snap
Signed-off-by: Max Gurtovoy <mgurtovoy@nvidia.com>
[merged up with "new" json usage] Signed-off-by: Keith Busch <kbusch@kernel.org>
Hannes Reinecke [Wed, 24 Mar 2021 07:22:28 +0000 (08:22 +0100)]
Read system UUID from DMI and merge hostnqn generation functions
Both fabrics.c and the 'gen-hostnqn' command contain functions
to generate a default host NQN. Needless to say, both are different.
_And_ we have the script 'nvme-gen-hostnqn' to read the host nqn
from the system UUID, implement yet another way.
This patch merges all of these functions, and implements a hierarchy
in generating host NQNs:
- Try reading system UUID from /sys/firmware/dmi/entries
- Try reading systemd-generated UUID
- Generate a random UUID
Hannes Reinecke [Sat, 20 Mar 2021 08:21:25 +0000 (09:21 +0100)]
nvme: build against json-c library
Allow to use the json-c library instead of our own hand-crafted
json routines. This allows us to add more functionality like
parsing json files in the future.
Martin Wilck [Sun, 17 Jan 2021 21:40:10 +0000 (22:40 +0100)]
fabrics: export symbols required for monitor functionality
These macros, functions and variables will be used by the "nvme monitor"
functionality. Convert them to globally visible symbols.
Being able to access struct config and the "cfg" variable
from fabrics.c is essential for the monitor to leverage the existing,
well tested code as much as possible. Rename "cfg" to "fabrics_cfg"
to make the global variable name less generic.
Martin Wilck [Fri, 5 Mar 2021 22:51:31 +0000 (23:51 +0100)]
fabrics: fix invalid memory access in discover_from_conf_file()
argconfig_parse() assigns pointers in cfg to point to memory allocated
in all_args. If this memory is freed, these pointers become dangling.
This is particularly dangerous if discovery.conf contains empty lines,
comment lines, or invalid lines.
Fix it by setting all transport parameter to NULL after processing each
line, and not proceeding if the basic parameters aren't set.
Martin Wilck [Thu, 25 Feb 2021 22:14:19 +0000 (23:14 +0100)]
fabrics: fix some memory leaks
None of these are critical for "nvme discover" or "nvme connect-all".
Still, silencing valgrind's error messages by fixing them gives some
peace of mind, and a longer-running program like the forthcomint
nvme monitor, leak checks are more important.
Use the previously introduced cleanup macros for this purpose.
Martin Wilck [Tue, 12 Jan 2021 18:50:05 +0000 (19:50 +0100)]
nvme-cli: add generic logging functionality
Add a msg() macro that allows more flexible customization of logging
both at build time and at run time. Allow several log levels, using
the well-known standard sylog levels. Also optionally allow printing
of log timestamps.
Put '#define LOG_FUNCNAME' before '#include "util/log.h"' to enable printing
the name of the calling function before the log message.
Use this functionality in the fabrics code for now, wherever fprintf(stderr, ...)
had been used.
No functional change except changing the output channel of 554db7d ("print
device name when creating a persistent device") from stdout to stderr.
Martin Wilck [Thu, 4 Mar 2021 17:16:03 +0000 (18:16 +0100)]
nvme: add some simplifying macros for __attribute__((cleanup()))
Using __attribute__((cleanup())) is very helpful for writing leak-free
code, but it requires lots of awkward boiler plate code. Add some
small helpers to make its use more comfortable.
Martin Wilck [Fri, 5 Mar 2021 20:19:10 +0000 (21:19 +0100)]
do_discover: free cfg.device when resetting it
cfg.device might have been allocated by a previous call to
find_ctrl_with_connectargs(), therefore free it. We must make sure
that cfg.device is always on the heap, thus change fabrics_discover()
accordingly.
If persistent controller connections are present they should be
preferred even if no --device option is given on the commandline.
To avoid selecting a temporary non-persistent controller the
'kato' attribute is checked; any controller for which the
attribute is '0' will be skipped. This logic is not applied on
older kernels that don't support the 'kato' attribute. On these
kernels, we just have to assume that the encountered discovery
controller is persistent.
Signed-off-by: Hannes Reinecke <hare@suse.de> Signed-off-by: Martin Wilck <mwilck@suse.de>
Hannes Reinecke [Wed, 24 Mar 2021 06:39:31 +0000 (07:39 +0100)]
fabrics: fix infinite loop on invalid parameters
When parsing the discovery entries results in invalid parameters
for a given connection we'll enter an infinite loop as the -EINVAL
error code is always assumed to indicate a wrong 'disable_sqflow'
setting.
Sagi Grimberg [Mon, 15 Mar 2021 20:52:09 +0000 (13:52 -0700)]
nvme-topology: have list-subsys print only controllers with attached namespace
When running list-subsys on a specific namespace, we output all the
controllers that belong to the subsystem regardless if the requested
namespace is actually attached to them.
Example:
$ nvme list-subsys /dev/nvme0n1
nvme-subsys0 - NQN=nqn.2016-01.com.lightbitslabs:uuid:07cfffe6-5a4f-4151-b663-e13cf835609b
\
+- nvme0 tcp traddr=10.113.5.1 trsvcid=4420 live optimized
+- nvme1 tcp traddr=10.133.3.1 trsvcid=4420 live
+- nvme2 tcp traddr=10.133.1.1 trsvcid=4420 live
+- nvme3 tcp traddr=10.113.1.1 trsvcid=4420 live inaccessible
+- nvme4 tcp traddr=10.133.5.1 trsvcid=4420 live
+- nvme5 tcp traddr=10.113.4.1 trsvcid=4420 live
+- nvme6 tcp traddr=10.133.4.1 trsvcid=4420 live
+- nvme7 tcp traddr=10.113.2.1 trsvcid=4420 live
+- nvme8 tcp traddr=10.133.2.1 trsvcid=4420 live
+- nvme9 tcp traddr=10.113.3.1 trsvcid=4420 live
This output is somewhat confusing and makes the user think the namespace
is actually attached to all of these controllers, instead we want the
output which provided by the change introduced here:
Do the same trick as we do in scan_subsystems, we add the ctrl to the
topology if either ns_instance wasn't passed (see all controllers) or
it was pased _and_ the controller has this namespace is attached to
the controller (to do that we add nsid down the call chain for that).
Gollu Appalanaidu [Sun, 14 Mar 2021 17:57:47 +0000 (23:27 +0530)]
nvme: address 1.4 to 1.4b changes for Change NS event type
In NVMe 1.4 spec. Change Namespace Event Type (06h) of persistent
event log Data Format - NCAP field was assigned with 16 bytes. Now
in 1.4b spec. its modified to 8 bytes.
Gollu Appalanaidu [Sun, 7 Mar 2021 18:32:35 +0000 (00:02 +0530)]
nvme-print: improve command support and effects log json format
Currently command support and effects log page on json format
printing all the opcodes irrespective of the command supported.
Most part of the log page is "Unkown" opcodes, fixed that. Also
added two json objects "acs" and "iocs" to print them seperately.
Martin George [Wed, 3 Mar 2021 06:19:19 +0000 (11:49 +0530)]
fabrics: ensure zero kato for non-persistent controllers
Add a helper function that sets the default kato value for the
discovery controllers. Along with ensuring a non-zero kato value
(i.e. NVMF_DEF_DISC_TMO) is passed to a persistent discovery
controller, it also ensures a zero kato value is passed to a
non-persistent discovery controller, as mandated by the NVMe spec.
Gollu Appalanaidu [Mon, 1 Mar 2021 18:37:07 +0000 (00:07 +0530)]
nvme-print: fix HMB get feature response in human readable format
For Host Memory Buffer(HMB) fearure as part of get feature Memory
Return(MR) bit is not part of the CQE CDW0 and add endianess conversion
for the get feature HMB attribute data structure.
Gollu Appalanaidu [Sat, 27 Feb 2021 19:40:01 +0000 (01:10 +0530)]
nvme: fix securuty send and receive commands result field
Security Send and Receive commands doesn't fill the CQE CDW0,
remove uncessary passing of result field in both the commands.
The changes are made in RPMB related files since it is dependent
on these commands. RPMB operations only result field as part of
the dataframe and this result is completely different from the
CQE CDW0.
Hannes Reinecke [Thu, 25 Feb 2021 15:53:00 +0000 (16:53 +0100)]
fabrics: correctly handle ctrl_loss_tmo settings for loop
The previous fix had an issue with referrals, as it would take
the default values and apply them to all referral entries.
And if the default entries were for 'loop', the default ctrl_loss_tmo
setting would not be used, but rather '-1'.
So this patch reverts the previous patch and correctly blanks
out the ctrl_loss_tmo setting when constructing the connect string.
Fixes: bdf4f3b ("fabrics: ctrl_loss_tmo setting is invalid for 'loop'") Signed-off-by: Hannes Reinecke <hare@suse.de>
The changes as per the Base NVMe 1.4b and NVMeOF 1.1 specification.
Optional Fabrics Command Support(OFCS) field added at wrong offset
of the structure, and other naming conventions fixed as per the Spec.
Gollu Appalanaidu [Tue, 23 Feb 2021 18:58:45 +0000 (00:28 +0530)]
nvme: add support for lba status log page
This log page is used to provide information about
subsequent actions the host may take to discover which
logical blocks, in namespaces that are attached to
the controller, may no longer be recoverable when read.
For more details see NVM Express 1.4 Spec. Section
5.14.1.14("LBA Status Information (Log Identifier 0Eh)")
Gollu Appalanaidu [Sat, 20 Feb 2021 16:30:09 +0000 (22:00 +0530)]
nvme: add support for endurance group event aggregate log
This log page indicates if an Endurance Group Event has
occurred for a particular Endurance Group, the details of
the particular event are included in the Endurance Group
Information log page for that Group. For details see
NVM Express 1.4 Spec. Section 5.14.1.15 ("Endurance Group
Event Aggregate (Log Identifier 0Fh)")
Hannes Reinecke [Wed, 17 Feb 2021 16:42:43 +0000 (17:42 +0100)]
fabrics: ctrl_loss_tmo setting is invalid for 'loop'
The 'ctrl_loss_tmo' setting is invalid for 'loop' devices, so move
the default to '-1' (for 'unset'), and only set the default timeout
if discovery is not done on a loop device and the user has not
specified another value.
Fixes: 68bc869 ("fabrics: fix passing ctrl_loss_tmo=0 by default") Signed-off-by: Hannes Reinecke <hare@suse.de>